AITopics | ld 2

Collaborating Authors

ld 2

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

b166b57d195370cd41f80dd29ed523d9-Supplemental.pdf

Neural Information Processing SystemsFeb-10-2026, 18:28:27 GMT

convergence, pd error, step size, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Russia (0.04)
Asia > Russia (0.04)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)

Industry:

Education (0.46)
Leisure & Entertainment (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.67)

Add feedback

7a53928fa4dd31e82c6ef826f341daec-Supplemental.pdf

Neural Information Processing SystemsFeb-9-2026, 01:34:00 GMT

algorithm, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts (0.04)
North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)
(2 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

A Detailed Theoretical Analysis

Neural Information Processing SystemsFeb-8-2026, 19:29:44 GMT

Curves only represents the process of the training phase.

artificial intelligence, machine learning, oom, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Scalable Heterophilous Graph Neural Network with Decoupled Embeddings

Neural Information Processing SystemsFeb-8-2026, 19:29:41 GMT

However, this assumption does not always hold in practice.

artificial intelligence, data mining, machine learning, (17 more...)

Neural Information Processing Systems

Country:

Asia > China > Hong Kong (0.04)
Oceania > Australia (0.04)
Asia > Singapore (0.04)
(2 more...)

Industry: Information Technology (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction Wei Jiang 1, Sifan Y ang 1,2, Wenhao Y ang

Neural Information Processing SystemsOct-9-2025, 23:46:26 GMT

Sign-based Stochastic V ariance Reduction (SSVR) method, which employs variance reduction estimators to track gradients and leverages their signs to update.

algorithm, convergence rate, optimization, (16 more...)

Neural Information Processing Systems

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > Middle East > Jordan (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)

Genre: Research Report > Experimental Study (0.92)

Industry: Information Technology (0.45)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Efficient Projection-Free Algorithms for Saddle Point Problems

Neural Information Processing SystemsOct-3-2025, 07:52:46 GMT

The Frank-Wolfe algorithm is a classic method for constrained optimization problems.

algorithm, ld 2, saddle point problem, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts (0.04)
North America > Canada (0.04)
Asia > Middle East > Jordan (0.04)
(2 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Decentralized sketching of low rank matrices

Rakshith Sharma Srinivasa, Kiryung Lee, Marius Junge, Justin Romberg

Neural Information Processing SystemsOct-3-2025, 04:58:56 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, matrix, (19 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Improved Analysis for Sign-based Methods with Momentum Updates

Jiang, Wei, Yu, Dingzhi, Yang, Sifan, Yang, Wenhao, Zhang, Lijun

arXiv.org Artificial IntelligenceJul-17-2025

In this paper, we present enhanced analysis for sign-based optimization algorithms with momentum updates. Traditional sign-based methods, under the separable smoothness assumption, guarantee a convergence rate of $\mathcal{O}(T^{-1/4})$, but they either require large batch sizes or assume unimodal symmetric stochastic noise. To address these limitations, we demonstrate that signSGD with momentum can achieve the same convergence rate using constant batch sizes without additional assumptions. Our analysis, under the standard $l_2$-smoothness condition, improves upon the result of the prior momentum-based signSGD method by a factor of $\mathcal{O}(d^{1/2})$, where $d$ is the problem dimension. Furthermore, we explore sign-based methods with majority vote in distributed settings and show that the proposed momentum-based method yields convergence rates of $\mathcal{O}\left( d^{1/2}T^{-1/2} + dn^{-1/2} \right)$ and $\mathcal{O}\left( \max \{ d^{1/4}T^{-1/4}, d^{1/10}T^{-1/5} \} \right)$, which outperform the previous results of $\mathcal{O}\left( dT^{-1/4} + dn^{-1/2} \right)$ and $\mathcal{O}\left( d^{3/8}T^{-1/8} \right)$, respectively. Numerical experiments further validate the effectiveness of the proposed methods.

artificial intelligence, machine learning, optimization problem, (18 more...)

arXiv.org Artificial Intelligence

2507.12091

Country: Asia > China (0.14)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.48)

Add feedback

Using Taylor-Approximated Gradients to Improve the Frank-Wolfe Method for Empirical Risk Minimization

Xiong, Zikai, Freund, Robert M.

arXiv.org Machine LearningNov-21-2023

The Frank-Wolfe method has become increasingly useful in statistical and machine learning applications, due to the structure-inducing properties of the iterates, and especially in settings where linear minimization over the feasible set is more computationally efficient than projection. In the setting of Empirical Risk Minimization -- one of the fundamental optimization problems in statistical and machine learning -- the computational effectiveness of Frank-Wolfe methods typically grows linearly in the number of data observations $n$. This is in stark contrast to the case for typical stochastic projection methods. In order to reduce this dependence on $n$, we look to second-order smoothness of typical smooth loss functions (least squares loss and logistic loss, for example) and we propose amending the Frank-Wolfe method with Taylor series-approximated gradients, including variants for both deterministic and stochastic settings. Compared with current state-of-the-art methods in the regime where the optimality tolerance $\varepsilon$ is sufficiently small, our methods are able to simultaneously reduce the dependence on large $n$ while obtaining optimal convergence rates of Frank-Wolfe methods, in both the convex and non-convex settings. We also propose a novel adaptive step-size approach for which we have computational guarantees. Last of all, we present computational experiments which show that our methods exhibit very significant speed-ups over existing methods on real-world datasets for both convex and non-convex binary classification problems.

artificial intelligence, frank-wolfe method, machine learning, (17 more...)

arXiv.org Machine Learning

2208.13933

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Filters

Collaborating Authors

ld 2

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

206191b9b7349e2743d98d855dec9e58-Paper-Conference.pdf

b166b57d195370cd41f80dd29ed523d9-Supplemental.pdf

7a53928fa4dd31e82c6ef826f341daec-Supplemental.pdf

A Detailed Theoretical Analysis

Scalable Heterophilous Graph Neural Network with Decoupled Embeddings

Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction Wei Jiang 1, Sifan Y ang 1,2, Wenhao Y ang

Efficient Projection-Free Algorithms for Saddle Point Problems

Decentralized sketching of low rank matrices

Improved Analysis for Sign-based Methods with Momentum Updates

Using Taylor-Approximated Gradients to Improve the Frank-Wolfe Method for Empirical Risk Minimization